[webgpu] support Pad operator #23141

xhcao · 2024-12-18T11:27:43Z

Description

Motivation and Context

xhcao · 2024-12-18T11:29:38Z

guschmue · 2024-12-19T21:01:28Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-12-19T21:01:36Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-12-19T21:01:43Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-12-19T21:01:43Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2024-12-19T21:01:49Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-12-19T21:02:02Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-12-19T21:02:04Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-12-19T21:02:17Z

Azure Pipelines successfully started running 9 pipeline(s).

onnxruntime/core/providers/webgpu/tensor/pad.cc

xhcao · 2024-12-20T04:02:24Z

@fs-eire @guschmue Please help to trigger the bots again. Last version failed on Mac OS, but could compile correctly on Windows. I had changed the code, but not ensured it worked correctly on Mac OS. The compiling error was shown as below.
/Users/runner/work/1/s/onnxruntime/core/providers/webgpu/tensor/pad.h:16:54: error: member initializer 'Program' does not name a non-static data member or base class PadProgram(const Mode mode, bool dim_value_zero) : Program{"Pad"}, mode_{mode}, dim_value_zero_{dim_value_zero} {}

guschmue · 2024-12-20T16:06:10Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-12-20T16:06:17Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-12-20T16:06:23Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-12-20T16:06:28Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2024-12-20T16:06:30Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-12-20T16:06:45Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-12-20T16:06:45Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-12-20T16:06:58Z

Azure Pipelines successfully started running 9 pipeline(s).

jchen10 · 2024-12-26T08:03:54Z

onnxruntime/core/providers/webgpu/tensor/pad.cc

+          .InputMemoryType(OrtMemTypeCPUInput, 1)                 \
+          .InputMemoryType(OrtMemTypeCPUInput, 2)                 \
+          .InputMemoryType(OrtMemTypeCPUInput, 3)                 \
+          .TypeConstraint("T", DataTypeImpl::GetTensorType<T>()), \


It seems you needn't have bothered with all the specialized stuff if you had used WebGpuSupportedNumberTypes() like this.

Pad is a template class, it should transfer template type when registering. I am not sure whether WebGpuSupportedNumberTypes() works correctly. I had referred CUDA EP.

Hi @fs-eire, as @jchen10 prefers to register the kernel using WebGpuSupportedNumberTypes(), according to the input element type to infer the type of padValue when running the kernel, also dynamically add uniforms as main...jchen10:onnxruntime:tmp
I use template class and only want to it as other EPs, take CUDA EP for an example.
What are your comments here?

@guschmue @fs-eire
My proposal is just an alternative to get the uniform type at runtime, so that we don't need to bother with the specialized template kernel class registrations. It just a minor change. If it's not beneficial enough in your view, let's keep the current solution and unblock this PR. Feel free to comment. I am okay either way.

based on today's understanding, I would suggest to use untemplated class which optimizes for binary size. vote for using WebGpuSupportedNumberTypes().

CUDA EP uses template class because nvcc can use that information to simplify the implementation. however for WebGPU, we are shader based so compiler does not really take the advantage of the template type.

guschmue · 2025-01-13T16:32:15Z

/azp run Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-01-13T16:32:26Z

Azure Pipelines successfully started running 1 pipeline(s).

fs-eire · 2025-01-14T00:29:15Z

onnxruntime/core/providers/webgpu/tensor/pad.cc

+
+  const PadsVector* p_pads = &pads_;
+  const PadsVector* p_slices = &slices_;
+  WebGpuT value = ToWebGpuType<T>::FromFloat(value_);


I would recommend to avoid converting the value f32 -> f16 here.

When value_ is being used, it means the model is a very old one - only opset 10 and below uses "value" from attributes. The type of attribute "value" is always float.

On opset >= 11, the value comes from 3rd input (ie. inputs[2]). the type of the value matches the input data (ie. inputs[0]).

My suggestion is to always use a u32 uniform to carry the value:

for opset <=10, the value of this uniform is always a bitwise representation of the float number

for opset > 10, the value of this uniform is always a bitwise representation of the corresponding type T (padding 2-bytes-of-0 for f16)

Inside WGSL, use type cast or bitcast to get the const value.

This helps with easier implementation of untemplated class.
This also helps to make it easier to support Android/iOS in future, considering most mobile devices does not support f16 in uniforms yet.

[webgpu] support Pad operator

fc309db

xhcao marked this pull request as ready for review December 18, 2024 11:29

guschmue added the ep:WebGPU ort-web webgpu provider label Dec 19, 2024

github-advanced-security bot found potential problems Dec 19, 2024

View reviewed changes

xhcao added 2 commits December 20, 2024 11:37

Fix compiling error on Mac OS

dbe430f

Merge branch 'main' into pad

491e597

jchen10 reviewed Dec 26, 2024

View reviewed changes

fs-eire reviewed Jan 14, 2025

View reviewed changes

guschmue approved these changes Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[webgpu] support Pad operator #23141

[webgpu] support Pad operator #23141

xhcao commented Dec 18, 2024

xhcao commented Dec 18, 2024

guschmue commented Dec 19, 2024

guschmue commented Dec 19, 2024

guschmue commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

guschmue commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

xhcao commented Dec 20, 2024

guschmue commented Dec 20, 2024

guschmue commented Dec 20, 2024

guschmue commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

guschmue commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

jchen10 Dec 26, 2024

xhcao Dec 27, 2024

xhcao Jan 8, 2025

jchen10 Jan 10, 2025

fs-eire Jan 14, 2025 •

edited

Loading

guschmue commented Jan 13, 2025

azure-pipelines bot commented Jan 13, 2025

fs-eire Jan 14, 2025 •

edited

Loading

[webgpu] support Pad operator #23141

Are you sure you want to change the base?

[webgpu] support Pad operator #23141

Conversation

xhcao commented Dec 18, 2024

Description

Motivation and Context

xhcao commented Dec 18, 2024

guschmue commented Dec 19, 2024

guschmue commented Dec 19, 2024

guschmue commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

guschmue commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

xhcao commented Dec 20, 2024

guschmue commented Dec 20, 2024

guschmue commented Dec 20, 2024

guschmue commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

guschmue commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

jchen10 Dec 26, 2024

Choose a reason for hiding this comment

xhcao Dec 27, 2024

Choose a reason for hiding this comment

xhcao Jan 8, 2025

Choose a reason for hiding this comment

jchen10 Jan 10, 2025

Choose a reason for hiding this comment

fs-eire Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

guschmue commented Jan 13, 2025

azure-pipelines bot commented Jan 13, 2025

fs-eire Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

fs-eire Jan 14, 2025 •

edited

Loading

fs-eire Jan 14, 2025 •

edited

Loading